Frames of Reference and the Intrinsic Directional Information of a Particle With Spin 
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"Information is physical", and here we consider the physical directional information of a particle 
with spin. We ask whether, in the presence of a classical frame of reference, such a particle contains 
any intrinsic directional information, ie. information above that which can be transmitted by a 
classical bit. We show that when sending a large number of spins, the answer is asymptotically 
"no". For finite numbers of spins, TV, we do not know the answer. We also show that any frame of 
reference which we can consider to be classical must use some resource which is exponentially large 
in TV. This gives a quantitative meaning to the idea that classical objects are big. 



I. INTRODUCTION 

In the last 20 years a conceptual revolution has swept 
through physics and computer science, beginning with 
the idea that " information is physical" . This is the idea 
that information is something which is encoded in the 
physical world, and has no existence without it. Since the 
physical world is in fact quantum mechanical, this must 
apply to information too, giving us quantum information. 

It has been usual in quantum information theory to 
view the physical system in which the information is 
stored as unimportant, viewing it simply as a carrier of 
information living in an abstract n-dimensional Hilbert 
space. Spin-i particles, pairs of energy levels, and pho- 
ton polarisations have all been treated in the same way, 
disregarding the fact that the first carries spatial informa- 
tion, the second carries time information, and the third a 
mixture of the two. Whilst it is often useful to ignore the 
differences, we feel it is time to treat the systems more 
carefully, taking into account the physical information 
they carry. We would like to understand exactly what 
is the content of the physical information, and what it 
can be used for. With this in mind, we looked at the 
spatial information carried in spin-i particles. We con- 
sidered the following problem. How well can one specify 
a direction in space by sending spin-i particles? By this 
we mean, suppose Alice wishes to tell Bob a direction in 
space, and is allowed to send Bob N spin-^ particles, in 
any state. How well can she do it? We shall compare: 

• N directional qubits (spin i particles). 

• N non-directional qubits. 

• N non-directional classical bits. 
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Intuitively the difference should tell us about the intrinsic 
directional information of the spins. 

One might want to consider also directional classical 
bits. The difficulty is that the very notion of a classical 
directional bit, although clear intuitively, is rather deli- 
cate and, as far we know, is not well established. One 
can send classical directional information by sending an 
arrow pointing in the desired direction. This however 
transmits an infinite number of bits of information. A 
finite amount of information would be transmitted by 
a "noisy" arrow, ie. by an arrow pointing in a direc- 
tion according to some probability distribution around 
the desired direction. But there are many different dis- 
tributions we could choose, such as a gaussian, or cos 2 
distribution, and we do not see why to choose one distri- 
bution over another. For this reason we shall not consider 
classical directional bits here. 

To make the comparison, we must also state whether 
Alice and Bob share any prior directional information. 
Consider first that Alice and Bob do not share any prior 
directional information. By this we mean that they do 
not begin with any shared frame of reference, ie. that 
there are no distant stars from which to fix a direction, 
or any other clues. If they were only allowed to send non- 
directional classical bits encoded for example as holes or 
blanks in a punch tape, they will not be able to send any 
directional information whatsoever. Nor would sending 
qubits encoded as directionless energy levels help. How- 
ever, since quantum spins point in some direction, they 
can be used to specify a direction in many different ways, 
and one can try to find the optimal method 0, IS El • 

In the first part of this paper we consider the case 
of perfectly aligned frames of reference. Using this it is 
possible to use classical bits to specify a direction, eg. by 
splitting the sphere into patches in advance, and using 
the classical bits to say within which patch the direction 
lies. The shared frame of reference also allows us to use 
the qubits as classical bits. Thus it is not necessary to use 
the directional information contained intrinsically within 
the spins in order to specify a direction. However, for 
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any finite number of bits/spins, the spins may be more 
efficient. That is, it may be possible, even with aligned 
frames, to use the intrinsic directional information con- 
tained within N spins to more precisely point a direction 
than if we were to use the spins simply as N classical 
bits. 

One might also compare the spins and classical bits 
to directionless qubits, such as energy levels, or time- 
bins. The idea would be that the difference between spins 
and energy levels tells us about the intrinsic directional 
information, whereas the difference between the energy 
levels and the classical bits tells us about the difference 
between quantum and classical. 

Our main results are that, with perfectly aligned 
frames: 

• N directional qubits are completely equivalent to N 
non-directional qubits (assuming that the appropri- 
ate non-directional frames of reference are perfectly 
aligned). 

• N qubits are, asymptotically in block coding, no 
better than N non-directional classical bits. 

This is surprising in one sense, since the spin can be 
placed in many possible states, and so seems to contain 
many bits of information. However it is not so surprising 
in the light of Holevo's theorem Q, since this tells us 
that a qubit cannot be used to transmit more than one 
classical bit of information (if there is no shared entan- 
glement). 

In the second part of the paper we will consider the 
case of non-perfectly aligned frames. For example, the 
frames could be specified by a fixed number of spins held 
by Alice, and similarly for Bob. We shall show that for 
the frames to behave classically when we send N further 
spins to specify a direction, the number of spins in the 
frames must be exponential in N. In other words, for a 
frame of reference to be considered classical, it must be 
exponentially big. 



II. DIRECTIONAL AND NON-DIRECTIONAL 
QUBITS ARE EQUIVALENT 

We shall now prove that, with perfectly aligned frames, 
N directional qubits are equivalent to N non-directional 
qubits. For clarity, consider a non-directional qubit en- 
coded in the energy levels of an atom. Furthermore, sup- 
pose that the energy frames are also aligned. By this we 
mean that Alice and Bob can agree on what the state 
(a \Ex) + \E 2 )) is. It is simple to agree on whether 
the qubit is in state \Ei) or \E 2 ). However in order to 
determine the relative phase, Alice and Bob will need to 
synchronize their clocks. 

With all frames aligned, Alice and Bob can agree upon 
a one-to-one mapping between the direction and energy 
frames. They then simulate the results of sending any 
spin qubit by sending an energy qubit. 



Once the frames are all aligned, a qubit is a qubit and 
there is no difference whether it is encoded as energy, or 
time, or direction. However, if we do not have perfectly 
aligned frames, this is no longer the case. Also, we are 
still left with the difference between the qubit and the 
classical bit, which we shall focus upon for the next part 
of this paper. 



III. QUBITS ARE ASYMPTOTICALLY 
EQUIVALENT TO BITS 

In order to state our second result properly, we must 
define what we mean by sending a direction precisely. 
For simplicity, we shall assume that Alice tries to send 
a direction h chosen from a finite set of directions, hi, 
according to probability p(h). We will insist that Bob 
must, at the end of the protocol, guess which direction 
was sent: we shall call his guess m, chosen from a set 
rhj . We shall give them some (bounded) score to say how 
well they do in any run, f{hi, rhj), and try to maximize 
the average score, averaged over many runs. We want 
to know whether they can get a better average score by 
sending N spin-i particles than they can by sending N 
classical bits. 

To allow for more generality, we allow the set of guessed 
directions to be different to the set of sent directions. One 
might wonder why one would guess a direction which was 
not sent. It could be useful to Bob if he is not sure which 
of two directions was sent, and so he guesses a direction 
inbetween the two, so that it is not too far from either. 

As is common in information theory, we shall find it 
simplest to perform the analysis in a block coding sce- 
nario. By block coding we mean that, rather than tak- 
ing one input direction, sending N spins/bits, and mak- 
ing one guess, we shall have Alice take a large num- 
ber, K, of directions, h k ; k — 1..K, each chosen from hi 
according to p(h), send KN spins/bits together, and 
have Bob make K guesses, fh k ;k = 1..K. We shall de- 
note Alice's block of K directions by h® K , and similarly 
Bob's block of guessed directions by fh® K . We will then 
have K pairs, (h k ,rh k ), and some probability distribu- 
tion, p(h® K , rh® K ), of input and output blocks of length 
K. The score will still be given by the single copy fidelity, 
depending only on pairs of directions, (h k ,rh k ), not on 
pairs of blocks, (n® , rh® K ). We shall be interested in 
the asymptotic limit of arbitrarily large block size, ie. 
K — » oo. 

In this asymptotic block coding scenario, we shall show 
that the best average score we can obtain with N spins 
can also be (arbitrarily closely) obtained using N classi- 
cal bits. We shall prove this in three stages. We shall 
first show that we can use block coding of classical bits 
and the shared classical randomness to arbitrarily closely 
simulate the probability of each pair of directions (hi, rhj) 
that occurs in any single copy protocol using N spins. 
We show from this that the block coded classical pro- 
tocol can obtain a fidelity arbitrarily close to the opti- 



3 



mal single copy quantum protocol. We shall finally show 
that classical block coding can also get us arbitrarily close 
to the optimal quantum block coded protocol. Thus, in 
the block coding setting, the intrinsic directional informa- 
tion of spins is useless if Alice and Bob share pre-aligncd 
frames. 

One would also like to know what happens in the sin- 
gle copy case. It would be simplest if, here too, we could 
mimic the spins using classical bits, and thus show that 
the spins contain no useful intrinsic directional informa- 
tion. However, we do not know whether or not this is 
possible. Even if this is not possible, it may be possible 
to show that the optimal average score for N spins can be 
obtained using N classical bits. Unfortunately we do not 
know whether or not this can be done in general. We do 
know that, provided the number of possible directions is 
the same as the number of different signals we are allowed 
to send {2 N ), it can be done for the most natural mea- 
sure of success, the classical mutual information between 
the sent and guessed directions, 



I(h, rh) = H(rh) — H(fn\h), 



(1) 



where the entropy, H, is given by the usual formula 

H ( fi ) = - S Z2v{hi)log 2 p(h l ). (2) 



We know this because Holevo's theorem Q tells us that 
using N spin-i particles we cannot create more than N 
bits of mutual information, something we can do with 
N classical bits. We leave open the other single copy 
fidelities, and shall use the remainder of the paper to 
show that, if we allow block coding, the spins are no 
more useful than classical bits. 



IV. SIMULATING THE QUANTUM 
PROBABILITIES 



Before giving the classical block coding protocol which 
arbitrarily closely simulates the single copy quantum 
probabilities, we define the typical set. The key ele- 
ment of ShannonQ and Schumacher^ compression is 
that if we look at many samples from a distribution, we 
are almost certain to find a sequence which lies in the 
weakly typical (entropy typical) set. This is the set of 
all sequences for which the logarithm of the probability 
is close to the entropy of the distribution. This set is 
much smaller than the set of all possible sequences, and 
so elements within it can be described by much fewer bits 
than would be required to describe an arbitrary sequence, 
giving us compression. 

In order to prove that our protocol works, we need 
to use a different sort of typical set, the strongly (or fre- 
quency) typical set. This is the set of all sequences which 
have frequencies of each outcome very close to the proba- 
bility of that outcome. Intuitively, one expects that if we 
take many samples from a distribution, the samples will 



almost certainly form a sequence within this frequency 
typical set. This is indeed the case. This set is quite 
similar to the weakly typical set, but is slightly smaller. 
The compression properties of the two sets are asymp- 
totically the same. We only use the strongly typical set 
because having the frequencies of all the outcomes close 
to the probabilities is very useful. Though well known 
in classical information theory, this kind of set is only 
beginning to appear in quantum information theory (see 
eg- 0). 

More precisely, the strongly typical set, A € K , is the set 
of blocks [h® K , rh® K ) such that 



#{hi,rhj) 



K 



< 



,V(i,j) 



(3) 



where #(hi,rhj) is the number of pairs of directions 
(h k ,rh k ) in the pair of blocks (h® K ,rh® K ) which point 
in the directions (hi,rhj), and \i\, |j| are the numbers of 
directions hi and rhj respectively. 

Now, we shall give the protocol. Later we shall explain 
why it works. 

Step 1: Alice and Bob agree in advance a large ta- 
ble, which is created by random sampling. Each entry 
in the table is actually an ordered list of K guessed di- 
rections, which are independently identically distributed 
(i.i.d.) according to PQ(rhj) — ^2nPQ{ni,rhj). The ta- 
ble has 2 if ( / ("' m ) +e ) entries, where I(h, rh) is the mutual 
information between h and rh in the quantum protocol. 
Thus it will take KI(h, hi) bits for Alice to specify to Bob 
a particular entry in the table. Since I(h, rh) < this 
will take at most KN bits, which is precisely the number 
of classical bits she is allowed to send. 

Step 2: Alice is given some list of K directions, h® K , 
which arc independently identically distributed according 
to p{h). 

Step 3: She looks at the table to see if there exists 
some entry, rh® K , such that (n® ,rh® K ) <E A e K (h,rh). 

Step 4: If she finds such an entry, she sends its index 
to Bob. If there is more than one, she picks the first such 
one. If there is no such entry, she sends the index 1 to 
Bob. 

Step 5: Bob uses the list of K guessed directions which 
Alice has pointed out to him. 

Because the fidelity is single copy, we are interested in 
the single copy probabilities of pairs of directions which 
this classical procedure produces. By single copy prob- 
ability we mean the probability of the pair of directions 
at the k th position in the block being (hi, rhj), ignoring 
(ie. summing over) the possible outcomes in all the other 
positions in the block. Since the protocol is symmetric 
between positions in the block, it does not matter which 
k we look at. We shall show that, by taking K suffi- 
ciently large, this procedure gives classical single copy 
probabilities arbitrarily close to the quantum ones. ie. 



\pc(hi,rhj) -pQ(hi,rhj)\ < 



>V(*,j), (4) 
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where pc(hi,fhj) is the classical single copy probability. 

The details of the proof that the protocol does this 
follow closely Section 13.6 of Cover and Thomas [lflj . 
We just give a sketch here. 

The proof is based upon the idea that pairs of blocks 
(n® K , rh® K ) € A e K (n, rh) have frequencies of pairs (n, rh) 
within e of the quantum probabilities, pq(h, rh). Thus, if 
we are almost certain that the protocol will give us input 
and output blocks of directions which are in the typical 
set, then the probability of a pair of directions (h, rh) 
appearing will be within e of the quantum probabilities. 

Now, the list of directions, h® K , which Alice is given 
is almost certain to be a strongly typical sequence, ie. a 
sequence in the set 



A* K (n) 



K 



< 777, Vj 

m 



(5) 



where j^hi is the number of directions h\ in the sequence 
n® K which point in the directions hi, and |i| is the num- 
bers of directions hi. 

Next, for any h® K £ A e K (h), if we add a sequence 
of guessed directions rh® K which are independently dis- 
tributed according to pQ^rhj), then the probability that 
the pair forms a jointly typical sequence, (h® K , fh® K ) e 
A e K (h,rh), is approximately 2~~ KI ( n,m \ 

Finally, if we take 2 Ar ( / (™>™) +e ) sequences of guessed 
directions in our table, we are almost certain to find one 
which is jointly typical with h® K . 

Thus our protocol indeed gives \p(rii,rhj) — 
PQ(hi,rhj)\ < |j^j,V(i,j). The difference between 
the quantum fidelity and our classical fidelity is given 
by: 

\Jq - fc\ < ^2 \f{n t ,rh j )\.\p Q {h l ,rh : j) -p c {hi,rhj)\ 

< ef max . (6) 

Thus our classical block coding protocol gives an aver- 
age fidelity arbitrarily close to the optimal single copy 
quantum one. 

Before using this result to show that classical block 
coding is as good as quantum block coding, we note that 
this classical procedure for producing joint probabilities 
is very general, and may be useful outside this context. It 
is a protocol where Alice takes one of various inputs, and 
sends Bob a sample of one of various probability distri- 
butions, which one depending upon Alice's input. In this 
sense it is somewhat like remote state preparation 
and for this reason we call it, "remote distribution prepa- 
ration" . Our protocol only sends I(n, rh) classical bits in 
order to do this. This amount is optimal since Bob learns 
rh) bits about Alice's source from his output, and so 
any procedure using less than I(n, rh) bits would allow 
us to send information faster than light [l2|. 

We note also that the shared table is quite large, and 
was created randomly. Thus one could ask if we might 
be using shared randomness, an additional resource. The 



difference between shared randomness and shared in- 
structions is that we can use the shared instructions many 
times without problems, whereas shared randomness is 
used up. A simple example is an unknown bit: the first 
time we look at it it is random, the second time it is 
the same as the first time, and so is no longer random. 
What happens if we use our table many times, ie. to 
mimic many sets of KN qubits? If we just look at the 
statistics of the spins individually, ie. of the individual 
pairs (n, rh), we will not see any problems. If however we 
look at the statistics of the blocks, (h® K , rh® K ), we shall 
see that the blocks are correlated between one run and 
the next. We may need many runs to see this, but it will 
happen eventually. This correlation would not exist if we 
took a fresh table each time, ie. had shared randomness. 
Fortunately we are only interested in the statistics of the 
individual pairs, and so can use one shared instruction 
table again and again without problem. 

As the table of shared instructions is very big, we also 
wonder if the same task can be performed with a smaller 
table. We do not know whether this is possible. 

To deal with quantum block coding first note that since 
the fidelity is bounded, there must be a quantum block 
code of finite length, M say, which gives an average fi- 
delity within e of the optimal (infinite length) one. For 
this M, we can take a classical block code of length KM 
which gives an average fidelity within e of the quantum 
code of length M. Thus we have a finite classical code 
which gives an average fidelity arbitrarily close to the in- 
finite length quantum block coded one, making the quan- 
tum and classical schemes asymptotically equivalent. 



V. RELATED QUESTIONS 

We have shown that when 2 parties have pre-aligned 
frames, the intrinsic directional information contained 
within spin-i particles is of no use for pointing a direc- 
tion in space, at least if we allow block coding. We may 
as well use the spins as classical bits. An open question 
is whether or not there is a difference for finite numbers 
of bits. 

Another question is whether the presence of unlim- 
ited shared entanglement would help us to use the di- 
rectional part of the quantum spins. It will certainly 
help us speci fy t he direction more precisely: using super- 
dense coding[jj|] we can send 2 classical bits of informa- 
tion with just 1 spin-i particle. If we use classical bits 
to specify the direction entanglement will not help us at 
all. However, this is just the normal superdense coding, 
and is not intrinsically directional. To see whether there 
is anything more we should compare N spins with 2N 
classical bits (both in the presence of shared prior entan- 
glement). Doing this, we can use the fact that N spins 
and shared entanglement cannot create more than 2A^ 
bits of mutual information between Alice and Bob's di- 
rections, combined with our earlier results, to show that 
2N classical bits can get an equally good average score. 
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Thus the presence of entanglement does not help unlock 
any directional information. Once we have aligned our 
frames of reference, we may as well treat spins as direc- 
tionless qubits. 

VI. CLASSICAL FRAMES ARE BIG 

Finally, we shall argue that classical objects, in partic- 
ular those which specify frames of reference, need to be 
very big. Recall that with a shared frame of reference, 
one can specify a direction in space using N spin-^ parti- 
cles with a fidelity Fjv ~ 1 — c ° 2 r ^ t ' . If one has no shared 
frame of reference, N spins only specify a direction up to 
F N ~ 1 — co ^f' 0>E1- Put another way, without a shared 

JV 

frame of reference one needs around 2~ spins to do the 
same job as N spins would do with a shared frame. Now, 
the classical objects which define the frame can be con- 
sidered to be M spin-i quantum mechanical particles 
with Alice, and M with Bob. We assume that Alice's 
object is not entangled with Bob's. Since we could try 
sending a direction without a pre-aligned frame by send- 

N 

ing N + M spins, it must be that M > 2~ — N. Hence a 
"classical" frame used to extract the full directional in- 
formation from N spins must itself consist of around 
spins: an exponential number. 

Of course, this is not the only way to specify a frame: 
one could also use two atoms a large distance apart to 
specify the direction. However we still have an expo- 
nential use of resources, either in the increasing distance 
between the two atoms, or in the increasing momentum 
of the atoms (which must be very uncertain to precisely 
define the position of the atoms). This makes sense: ev- 
ery time we double the precision we require twice as many 
resources. 

For current measurements the exponential growth is 
not a problem. In astronomy, angles in the x-y plane 
can be measured with a precision of 10 radians, and 



can be specified using a shared frame of reference and 34 
classical bits. To specify such an angle using spins, we 
need a total angular momentum L > AL > -ra . Since N 
spins have L = iV§, we only need around 10 11 spins. If 
we made the frame from two atoms, they could each be 
localised to 10 atom widths (=10~ 9 m), and located 100 
meters apart. 

Whilst these frames are relatively small, they are leav- 
ing the realm of the microscopic. As we probe the phys- 
ical world more and more closely, we can expect to see 
our probes getting bigger and bigger. 
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Note Added: there are several other papers 0, Q, 
Il5j which contain protocols closely related to the classical 
protocol in this paper. For example |Q| contains the re- 
lated result that one can, using block coding and shared 
randomness, arbitrarily closely simulate a classical noisy 
channel of capacity C using a classical noiseless one of 
the same capacity. A noisy channel is defined by p(y\x), 
which is the probability that the channel gives output y 
when the input is x. Its capacity is given by 

C = max p{x) I(x,y), (7) 

where the maximum is over our choice of how to use the 
channel, ie. the probabilities of the various inputs p(x). 
This is essentially the same as the problem of Alice tak- 
ing some input directions according to some distribution 
p(x), and trying to give Bob the output directions ac- 
cording to p(y\x), sending only I(x,y) bits of classical 
information (down a perfect channel). Despite the simi- 
larities, the aim our paper (to investigate the intrinsic di- 
rectional information of spin-i particles), and theirs (to 
investigate the classical capacity of quantum channels) 
were quite different. 
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